智能论文笔记

Clustering and Analysis of GPS Trajectory Data using Distance-based Features

Zann Koh , Yuren Zhou , Billy Pik Lik Lau , Ran Liu , Keng Hua Chong , Chau Yuen

分类：机器学习

2022-12-01

The proliferation of smartphones has accelerated mobility studies by largely increasing the type and volume of mobility data available. One such source of mobility data is from GPS technology, which is becoming increasingly common and helps the research community understand mobility patterns of people. However, there lacks a standardized framework for studying the different mobility patterns created by the non-Work, non-Home locations of Working and Nonworking users on Workdays and Offdays using machine learning methods. We propose a new mobility metric, Daily Characteristic Distance, and use it to generate features for each user together with Origin-Destination matrix features. We then use those features with an unsupervised machine learning method, $k$-means clustering, and obtain three clusters of users for each type of day (Workday and Offday). Finally, we propose two new metrics for the analysis of the clustering results, namely User Commonality and Average Frequency. By using the proposed metrics, interesting user behaviors can be discerned and it helps us to better understand the mobility patterns of the users.

translated by 谷歌翻译

GreenPLM: Cross-lingual pre-trained language models conversion with (almost) no cost

Qingcheng Zeng , Lucas Garay , Peilin Zhou , Dading Chong , Yining Hua , Jiageng Wu , Yikang Pan , Han Zhou , Jie Yang

分类：自然语言处理

2022-11-13

While large pre-trained models have transformed the field of natural language processing (NLP), the high training cost and low cross-lingual availability of such models prevent the new advances from being equally shared by users across all languages, especially the less spoken ones. To promote equal opportunities for all language speakers in NLP research and to reduce energy consumption for sustainability, this study proposes an effective and energy-efficient framework GreenPLM that uses bilingual lexicons to directly translate language models of one language into other languages at (almost) no additional cost. We validate this approach in 18 languages and show that this framework is comparable to, if not better than, other heuristics trained with high cost. In addition, when given a low computational cost (2.5\%), the framework outperforms the original monolingual language models in six out of seven tested languages. We release language models in 50 languages translated from English and the source code here.

translated by 谷歌翻译

METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19 Related Tweets

Peilin Zhou , Zeqiang Wang , Dading Chong , Zhijiang Guo , Yining Hua , Zichang Su , Zhiyang Teng , Jiageng Wu , Jie Yang

分类：自然语言处理

2022-09-28

Covid-19-Pandemic继续在社交媒体上提出各种讨论或辩论的主题。为了探索大流行对人们生活的影响，了解公众对与大流行有关的实体（例如药物，疫苗）对社交媒体的关注和态度至关重要。但是，对现有命名实体识别（NER）或目标情感分析（TSA）数据集培训的模型具有有限的理解与COVID相关的社交媒体文本的能力有限，因为这些数据集并未从医学角度设计或注释。本文释放了Mets-COV，这是一种包含医疗实体的数据集和与COVID相关的推文中的目标情感。 Mets-COV包含10,000条带有7种实体的推文，包括4种医疗实体类型（疾病，药物，症状和疫苗）和3种通用实体类型（人，位置和组织）。为了进一步调查推文用户对特定实体的态度，选择了4种类型的实体（人，组织，药物和疫苗），并用用户情感注释，从而产生了具有9,101个实体（5,278个推文）的目标情感数据集。据我们所知，METS-COV是第一个收集与COVID相关推文的医疗实体和相应情感的数据集。我们通过广泛的实验对经典机器学习模型和最先进的深度学习模型进行基准测试。结果表明，该数据集在NER和TSA任务方面都有大量改进的空间。 METS-COV是开发更好的医学社交媒体工具并促进计算社会科学研究的重要资源，尤其是在流行病学方面。我们的数据，注释准则，基准模型和源代码公开可用（https://github.com/ylab-open/mets-cov），以确保可重复性。

translated by 谷歌翻译

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

Junke Wang , Dongdong Chen , Zuxuan Wu , Chong Luo , Luowei Zhou , Yucheng Zhao , Yujia Xie , Ce Liu , Yu-Gang Jiang , Lu Yuan

分类：计算机视觉

2022-09-15

本文介绍了Omnivl，这是一种新的基础模型，旨在使用一种通用体系结构来支持图像语言和视频语言任务。它为图像和视频输入采用了统一的基于变压器的视觉编码器，因此可以执行联合图像语言和视频语言预处理。我们首次证明了这样的范式受益于图像和视频任务，而不是传统的单向传输（例如，使用图像语言来帮助视频语言）。为此，我们提出了对图像语言和视频语言的脱钩关节预处理，以有效地将视觉模型分解为空间和时间维度，并在图像和视频任务上获得性能提升。此外，我们引入了一种新颖的统一视觉对比度（UNIVLC）损失，以利用图像文本，视频文本，图像标签（例如，图像分类），视频标签（例如，视频动作识别）在一起受到监督和吵闹的监督预处理数据都尽可能多地利用。无需额外的任务适配器，Omnivl可以同时支持仅视觉任务（例如，图像分类，视频操作识别），跨模式对齐任务（例如，图像/视频 - 文本检索）和多模式理解和生成任务（例如，图像/视频问答，字幕）。我们在各种下游任务上评估Omnivl，并以相似的模型大小和数据量表获得最新的或竞争结果。

translated by 谷歌翻译

Learning Ball-balancing Robot Through Deep Reinforcement Learning

Yifan Zhou , Jianghao Lin , Shuai Wang , Chong Zhang

分类：机器人

2022-08-22

平衡机器人（Ballbot）是测试平衡控制器有效性的好平台。考虑到平衡控制，已经广泛使用了基于模型的反馈控制方法。但是，接触和碰撞很难建模，并且通常导致平衡控制失败，尤其是当球机器人倾斜的角度时。为了探索球机器人的最大初始倾斜角，平衡控制被解释为使用增强学习（RL）的恢复任务。 RL是难以建模的系统的强大技术，因为它允许代理通过与环境进行交互来学习策略。在本文中，通过将常规反馈控制器与RL方法相结合，提出了化合物控制器。我们通过训练代理成功执行涉及联系和碰撞的恢复任务来显示化合物控制器的有效性。仿真结果表明，与常规基于模型的控制器相比，使用化合物控制器可以在更大的初始倾斜角度下保持平衡。

translated by 谷歌翻译

CTooth+: A Large-scale Dental Cone Beam Computed Tomography Dataset and Benchmark for Tooth Volume Segmentation

Weiwei Cui , Yaqi Wang , Yilong Li , Dan Song , Xingyong Zuo , Jiaojiao Wang , Yifan Zhang , Huiyu Zhou , Bung san Chong , Liaoyuan Zeng

分类：人工智能 | 计算机视觉

2022-08-02

准确的牙齿体积分割是计算机辅助牙齿分析的先决条件。基于深度学习的牙齿分割方法已经达到了令人满意的表现，但需要大量的牙齿数据。公开可用的牙科数据是有限的，这意味着无法在临床实践中复制，评估和应用现有方法。在本文中，我们建立了一个3D Dental CBCT数据集Ctooth+，具有22个完全注释的卷和146个未标记的体积。我们进一步评估了基于完全监督的学习，半监督学习和积极学习的几种最先进的牙齿量细分策略，并定义了绩效原则。这项工作为牙齿体积分割任务提供了新的基准，该实验可以作为未来基于AI的牙科成像研究和临床应用开发的基线。

translated by 谷歌翻译

Learning Prototype via Placeholder for Zero-shot Recognition

Zaiquan Yang , Yang Liu , Wenjia Xu , Chong Huang , Lei Zhou , Chao Tong

分类：计算机视觉

2022-07-29

零拍学习（ZSL）旨在通过利用所见类和看不见的类之间共享的语义描述来识别看不见的类。当前的方法表明，通过将语义嵌入将视觉空间投射到视觉空间中是类原型，从而有效地学习视觉语义对齐是有效的。但是，这样的投影函数仅与可见的类有关。当应用于看不见的类时，原型通常由于域移位而次优。在本文中，我们建议通过称为LPL的占位符学习原型，以消除看到和看不见的阶级之间的域转移。具体来说，我们将看到的课程结合在一起，以使新课程成为视觉和语义空间中看不见的班级的占位符。占位持有人放置在看到的班级之间，鼓励人们高度分散所见类的原型。插入良好的看不见的空间也可以保留更多的空间。从经验上讲，分离良好的原型有助于抵消由域转移引起的视觉声音错位。此外，我们利用一种新颖的面向语义的微调来保证占位符的语义可靠性。在五个基准数据集上进行的广泛实验证明了LPL在最新方法上的显着性能提高。代码可在https://github.com/zaiquanyang/lpl上找到。

translated by 谷歌翻译

Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective

Qingcheng Zeng , Dading Chong , Peilin Zhou , Jie Yang

分类：自然语言处理

2022-06-26

Accented speech recognition and accent classification are relatively under-explored research areas in speech technology. Recently, deep learning-based methods and Transformer-based pretrained models have achieved superb performances in both areas. However, most accent classification tasks focused on classifying different kinds of English accents and little attention was paid to geographically-proximate accent classification, especially under a low-resource setting where forensic speech science tasks usually encounter. In this paper, we explored three main accent modelling methods combined with two different classifiers based on 105 speaker recordings retrieved from five urban varieties in Northern England. Although speech representations generated from pretrained models generally have better performances in downstream classification, traditional methods like Mel Frequency Cepstral Coefficients (MFCCs) and formant measurements are equipped with specific strengths. These results suggest that in forensic phonetics scenario where data are relatively scarce, a simple modelling method and classifier could be competitive with state-of-the-art pretrained speech models as feature extractors, which could enhance a sooner estimation for the accent information in practices. Besides, our findings also cross-validated a new methodology in quantifying sociophonetic changes.

translated by 谷歌翻译

Plasticity Neural Network Based on Astrocytic Influence at Critical Period, Synaptic Competition and Compensation by Current and Mnemonic Brain Plasticity and Synapse Formation

Jun-Bo Tao , Bai-Qing Sun , Wei-Dong Zhu , Shi-You Qu , Ling-Kun Chen , Jia-Qiang Li , Chong Wu , Yu Xiong , Jiaxuan Zhou

分类：神经与进化计算 | 机器学习 | (统计)机器学习

2022-03-19

我们NN的机制与最新的MIT脑可塑性研究的结果非常一致，研究人员发现，随着突触的增强，相邻的突触会自动削弱自身以补偿。关于这种机制的重要性，斯坦福大学Luo博士的团队表示，关于突触形成的树突形态发生的竞争至关重要。我们试图通过模型在关键时期关闭时通过模型与以前的研究相反，对大脑可塑性的失败机理进行研究。尖端成像和遗传工具在其实验研究中结合在一起，而我们的研究更加重视新NN的模型，推导和模拟。在测试中，证明树突在一定程度上的产生是通过突触形成遏制的。在研究中还考虑了电流和助记符脑可塑性以及突触作用范围。此外，新NN的框架基于当前的梯度信息和助记符负和阳性梯度信息突触形成。助记符梯度信息需要考虑被遗忘的记忆 - 腹部突触形成记忆持续性因子（包括阴性和正面记忆 - 即迄今且相对较低的梯度信息）。我们发现，像吞噬作用因子一样，星形细胞记忆持续性因子会产生减少突触局部积累的作用。无论梯度更新如何，仅考虑突触吞噬作用的PNN，以及是否取消了不同变量和突触位置的突触吞噬作用，是否由相应时间间隔的相关系数确定，证明简单且有效。

translated by 谷歌翻译

DenseCLIP: Extract Free Dense Labels from CLIP

Chong Zhou , Chen Change Loy , Bo Dai

分类：计算机视觉 | 自然语言处理

2021-12-02

对比语言 - 图像预训练（剪辑）在开放词汇零拍摄图像识别方面取得了显着突破。许多最近的研究利用预先训练的剪辑模型进行图像级分类和操纵。在本文中，我们进一步探索了剪辑的电位，用于像素级致密预测，具体地在语义分割中。在没有注释和微调的情况下，我们的方法Denseclip会产生合理的分段结果，在各种数据集中的开放概念上产生了合理的分段结果。通过添加伪标签和自我培训，Denseclip +超越了SOTA转换零点语义分割方法，通过大幅边缘，例如，Pascal VOC / Pascal Context / Coco Sift的宣传课程从35.6 / 20.7 / 30.3到86.1 / 66.7 / 54.7。我们还在输入损坏下测试了Denseclip的稳健性，并评估其在识别细粒度物体和新颖概念中的能力。我们的发现表明，Denseclip可以作为致密预测任务的新可靠的监督源，以实现无批准的分割。

translated by 谷歌翻译